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ABSTRACT 

Active DNA demethylation is crucial for epigenetic 
control, but the underlying enzymatic mechanisms 
are incompletely understood. REPRESSOR OF 
SILENCING 1 (ROS1) is a 5-methylcytosine (5-meC) 
DNA glycosylase/lyase that initiates DNA 
demethylation in plants through a base excision repair 
process. The enzyme binds DNA nonspecifically and 
slides along the substrate in search of 5-meC. In this 
work, we have used homology modelling and bio- 
chemical analysis to gain insight into the mechanism 
of target location and recognition by ROS1 . We have 
found that three putative helix-intercalating residues 
(Q607, R903 and M905) are required for processing of 
5-meC:G pairs, but dispensable for excision of mis- 
matched 5-meC. Mutant proteins Q607A, R903A and 
M905G retain the capacity to process an abasic site 
opposite G, thus suggesting that all three residues 
play a critical role in early steps of the base extrusion 
process and likely contribute to destabilization of 5- 
meC:G pairs. While R903 and M905 are not essential 
for DNA binding, mutation of Q607 abrogates stable 
binding to both methylated and nonmethylated DNA. 
However, the mutant protein Q607A can form stable 
complexes with DNA substrates containing blocked 
ends, which suggests that Q607 intercalates into the 
helix and inhibits sliding. Altogether, our results 
suggest that ROS1 uses three predicted helix- 
invading residues to actively interrogate DNA in 
search for 5-meC. 

INTRODUCTION 

DNA methylation at carbon 5 of cytosine (5-meC) is a 
reversible epigenetic mark essential for cell differentiation 
and genome defense against transposable elements (1). 



Active DNA demethylation processes play a pivotal role 
in shaping methylation patterns, but the underlying mech- 
anisms are still incompletely understood (2,3). In plants, 
active DNA demethylation is initiated by a family of 
DNA glycosylases that specifically excise 5-meC and 
initiate its replacement with unmethylated C in a base 
excision repair (BER) process (2,4). No 5-meC DNA 
glycosylases have been unambiguously identified in 
animals, which may resort to BER of deaminated and/or 
oxidized derivatives of 5-meC to perform demethylation 
(5,6). 

Plant 5-meC DNA glycosylases are typified by 
Arahidopsis REPRESSOR OF SILENCING 1 (ROS1) 
(7-9), and its paralogs DME (DEMETER), DML2 and 
DML3 (DEMETER-like proteins 2 and 3) (8,10-12). 
ROS1, DML2 and DML3 counteract excessive methyla- 
tion at several hundred loci across the genome in vegeta- 
tive tissues (1 1-13). DME demethylates the maternal allele 
of imprinted genes in the endosperm (14), but its basal 
function appears to be the reactivation of transposons in 
companion cells to generate short interfering RNAs 
(siRNAs) that would reinforce transposon silencing in 
male and female gametes (15,16). ROS1, DML2 and 
DML3 may also contribute to such reactivation in male 
gametes (15). All four proteins are bifunctional enzymes 
with an associate lyase activity that cleaves the phospho- 
diester backbone at the 5-meC removal site by (3,8-elimin- 
ation, generating as a major product a single-nucleotide 
gap flanked by 3'-phosphate and 5'-phosphate termini 
(8,9,11,12,14). The DNA 3'-phosphatase zinc finger 
DNA 3'-phosphoesterase (ZDP) functions downstream 
of ROS1 by removing the blocking 3'-phosphate (17), 
and the repair protein XRCC1 stimulates 5-meC 
excision and facilitates 3'-end cleaning and DNA 
ligation (18). 

Plant 5-meC DNA glycosylases belong to the HhH- 
GPD superfamily (19), but they are distinctively 
characterized by a bipartite catalytic domain divided by 
a large insert predicted to have an unstructured 
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conformation (20). They also contain a carboxy-terminal 
domain of unknown function (8), and a short amino- 
terminal domain significantly rich in lysine (21). In 
ROS1, this basic domain mediates strong methylation-in- 
dependent binding (21) and endows the protein with the 
capacity to perform facilitated diffusion through random 
ID sliding along DNA (22). 

ROS1 and its homologs face the challenge to locate and 
excise a modified, but otherwise nondamaged, correctly 
paired base. Although no crystal structure is available 
for any 5-meC DNA glycosylase, the combination of 
sequence alignment and available structural data of 
HhH-GPD enzymes allows predicting functional residues 
that can be tested by site-directed mutagenesis. Most 
DNA glycosylases use a common base-flipping mechanism 
to extrude the target lesion from the base stack into a 
substrate-recognition pocket (23-25). The resulting distor- 
tion of the DNA is stabilized by insertion of a bulky 
intercalating side chain that plugs the vacant space left 
by the flipped-out nucleotide and a second side chain 
that wedges between the bases on the opposite strand 
(25). In a previous study (20), we used homology modeling 
and biochemical analysis to identify residues important 
for ROS1 function. Two amino acids predicted to be pos- 
itioned between the base stack and the recognition pocket 
(T606 and D611) were found to be essential for catalysis, 
whereas mutational changes in two aromatic residues pre- 
sumably located in the substrate-binding pocket (F589 
and Y1028) altered the base specificity of the enzyme. 
Our study also proposed Q607, which is essential for 
both catalytic activity and stable DNA binding, as a 
strong candidate for the plug residue that replaces the 
flipped 5-meC in the base stack (20). 

A central question that remains to be answered is how 
ROS1 locates its target base. In this work, we have built 
on our previous analysis (20) by investigating in detail the 
functional role of the putative plug residue Q607 and two 
other putative helix-intercalating amino acids (R903 and 
M905) that are predicted to contact the orphan G on the 
complementary strand. We have found that all three 
residues are specifically required for excision of 5-meC:G 
pairs, but dispensable for excision of mismatched 5-meC. 
We also found evidence that Q607, which is essential for 
stable methylation-independent DNA binding, slows 
down ROS1 sliding along DNA. Altogether, our results 
suggest that ROS1 performs sequential extrusion of every 
base for extrahelical interrogation while sliding along the 
DNA in search of 5-meC. 



MATERIALS AND METHODS 

DNA substrates 

Oligonucleotides used as DNA substrates (Supplementary 
Table SI) were synthesized by Operon or Integrated DNA 
Technologies (IDT) and purified by polyacrylamide gel 
electrophoresis (PAGE) before use. Double-stranded 
DNA substrates were prepared by mixing a 5-uM 
solution of a 5'-fluorescein-labeled oligonucleotide 
(upper-strand) with a 10-uM solution of an unlabeled 
oligomer (lower-strand), heating to 95°C for 5min and 



slowly cooling to room temperature. Annealing reactions 
for the preparation of the 1-nt gapped duplex were carried 
out at 95°C for 5min in the presence of a 2-fold molar 
excess of both unlabeled 5'-phosphorylated oligonucleo- 
tide (P30_51) and unlabeled oligonucleotide (CGR) with 
respect to the 5'-alexa-labeled 3'-phosphorylated oligo- 
nucleotide (A1-28P), followed by cooling to room tem- 
perature. DNA containing a natural AP site opposite 
guanine was prepared by incubating a DNA duplex con- 
taining a U:G mispair (200 nM) with 2.5 U of Escherichia 
coli Uracil DNA glycosylase (New England BioLabs) at 
30°C for 5min. Substrates SL1 and SL1-2 contain the 
same tetraloop obstacle at either one or both molecule 
ends, respectively. The obstacle is created by a 6-bp 
DNA helix capped at both termini with tetraloops, 
which is connected to the DNA substrate via a four-way 
junction (22). Both substrates were obtained by annealing 
two oligonucleotides of different lengths, as previously 
described (22). 

Production of ROS1 mutant versions 

Site-directed mutagenesis was performed using the Quick- 
Change II XL kit (Stratagene). The mutations were 
introduced into the expression vector pET28a (Novagen) 
containing the full-length wild-type (WT) ROS1 cDNA 
using specific oligonucleotides (Supplementary Table S2). 
Mutational changes were confirmed by DNA sequencing, 
and the constructs were used to transform E. coli BL21 
(DE3) dcm~ Codon Plus cells (Stratagene). WT and 
mutant versions were expressed and purified as N- 
terminal His-tagged proteins, as previously described 
(21,26). 

Enzyme activity assays 

Fluorescein-labeled duplex oligonucleotides (20 nM, 
unless otherwise stated) were incubated at 30° C for the 
indicated times in a reaction mixture containing 50 mM 
Tris-HCl pH 8.0, 1 mM EDTA, ImM dithiothreitol 
(DTT), O.lmg/ml bovine serum albumin (BSA) and the 
indicated amounts of WT ROS1 or mutant variant in a 
total volume of 50 ul. When reactions included AP endo- 
nuclease 1 (APE 1, 5U; New England BioLabs), EDTA 
was omitted and 5mM MgCl 2 was added. When 
measuring AP lyase activity, the reaction mixture was 
then incubated with 300 mM NaBH 4 at 0°C for 30min 
and neutralized with lOOmM Tris-HCl pH 7.4. 
Reactions were stopped by adding 20 mM EDTA, 0.6% 
sodium dodecyl sulphate and 0.5 mg/ml proteinase K, and 
the mixtures were incubated at 37°C for 30min. DNA was 
extracted with phenol:chloroform:isoamyl alcohol 
(25:24:1) and ethanol precipitated at — 20°C in the 
presence of 0.3 mM NaCl and 16ug/ml glycogen. 
Samples were resuspended in 10 ul of 90% formamide 
and heated at 95° C for 5min. Reaction products were 
separated in a 12% denaturing polyacrylamide gel con- 
taining 7 M urea. Fluorescein-labeled DNA was visualized 
in a FLA-5100 imager and analyzed using Multigauge 
software (Fujifilm). 
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Kinetic analysis 

ROS1 does not exhibit significant turnover in vitro owing 
to strong product binding (26), and therefore a simple 
Michaelis-Menten model is inadequate for a correct 
kinetic analysis of this enzyme. The standard reaction con- 
ditions were equimolar (20 nM) enzyme/substrate ratios 
and incubation at 30°C. Data were fitted to the equation 
[Product] = Pmaxtl - exp (_kt) ] using nonlinear regression 
analysis and the software Sigmaplot. For each mutant 
enzyme and substrate, the parameters P max (maximum 
substrate processing within an unlimited period of time), 
(the time required to reach 50% of the product plateau 
level, P max ) and the relative processing efficiency 
{E te \ = P msa JT 50 ) were determined (27). A representative 
example of 5-meC DNA glycosylase assay and kinetic 
analysis is shown in Supplementary Figure SI. 

Electrophoretic mobility shift assay 

In standard electrophoretic mobility shift assay reactions, 
increasing amounts of WT ROS 1 or mutant variants were 
incubated with lOnM fluorescein- or alexa-labeled duplex 
oligonucleotides, unless otherwise stated. Competition 
band-shift reactions were performed by preincubating 
WT ROS1 or mutant variants (130 nM) with lOOnM 
fluorescein-labeled substrates at 25° C for 5min and then 
adding increasing amounts of unlabeled duplex as com- 
petitor. DNA binding reactions were carried out at 25°C 
for 60min, unless otherwise stated, in 10 mM Tris-HCl, 
pH 8.0, ImM DTT, 10(.ig/ml BSA, 1 mM EDTA, in a 
final volume of lOixl. Complexes were electrophoresed 
through 0.2% agarose gels in lx TAE (40 mM Tris- 
HCl, pH 8.0, 20 mM acetic acid, ImM EDTA). 
Electrophoresis was carried out in lx TAE for 40min at 
80 V at room temperature. Fluorescein- or alexa-labeled 
DNA was visualized in a FLA-5100 imager and analyzed 
using Multigauge software (Fujifilm). 

RESULTS 

Identification of putative helix-intercalating residues 
in ROS1 

In E. coli Endonuclease III, the side chain of L82 serves as 
the wedge that intercalates into the DNA duplex and 
stacks with the G opposite the lesion (estranged G), 
which donates a hydrogen bond to the backbone 
carbonyl of 180 (28). The homologous residues of L82 
and 180 in ROS1 are M905 and R903, respectively 
(Figure IB). The residue R903 is conserved within the 
ROS1/DME family of DNA demethylases, whereas Q 
instead of M is observed at the homologous position of 
M905 in some members of the family (Supplementary 
Figure S2). The modeled structure of ROS1 (20) predicts 
that M905 and R903 are positioned close to the G 
opposite 5-meC (Figure 1C). To test the prediction that 
M905 and R903 have a role in ROS1 enzymatic activity, 
we mutated them to Gly (M905G) and Ala (R903A), re- 
spectively. In our experiments, we also used the mutant 
version Q607A, which lacks the putative plug residue (20). 



Q607, R903 and M905 are specifically required for 
processing of 5-meC:G base pairs 

We first examined the ability of WT and mutant proteins 
M905G, R903A and Q607A to process a 51-mer duplex 
oligo substrate containing a single 5-meC opposite G, A, T 
or C (Figure 2). Consistent with our previously reported 
observations (26), we found that WT ROS1 processed 
5-meC with higher efficiency when mispaired with either 
C, T or A than when paired with G. Mutants M905G and 
R903A had no detectable base excision activity on a 
5-meC:G pair, whereas the Q607A mutant displayed a 
strongly reduced activity on the same substrate 
(Figure 2). Interestingly, however, all three mutant 
proteins retained a significant activity on DNA substrates 
containing a mismatched 5-meC. In contrast, two catalyt- 
ically disabled mutants (D611V and T606L) did not show 
detectable base excision activity on either paired or 
mispaired 5-meC (data not shown). Therefore, Q607, 
R903 and M905 are specifically required for efficient 
excision of 5-meC opposite G. 

These results strongly suggest significant differences in 
recognition and processing of 5-meC:G pairs and 5-meC 
mismatches. To explore further such possibility, we 
examined the temperature dependence of WT ROS1 
activity on both types of substrates (Supplementary 
Figure S3). The catalytic activity on all four DNA sub- 
strates increases with temperature from 5 to 30°C, but the 
temperature dependence is significantly lower for the 
DNA substrate containing a 5-meC:G pair. 

ROS1 is a bifunctional DNA glycosylase/lyase that 
catalyzes both the release of 5-meC and the cleavage of 
DNA at the resulting abasic site (8). We therefore asked 
whether the incapacity of the three mutant proteins to 
process 5-meC:G pairs is due to a deficiency in DNA 
glycosylase activity, lyase activity or both (Figure 3). To 
detect DNA glycosylase activity, we analyzed the reaction 
products generated by different ROS1 variants after an 
additional alkaline treatment with NaOH, which cleaves 
all abasic sites generated by the enzyme and reflects 5-meC 
excision. We found that R903A and M905G mutant 
enzymes did not generate detectable incision products, 
whereas the Q607A variant showed a significantly 
decreased activity (Figure 3B). Analogous results were 
obtained when performing reactions in the presence of 
human APE1 (data not shown). We next tested whether 
the three variants retained AP lyase activity by incubating 
the proteins with a 51-mer duplex oligo substrate contain- 
ing an AP site opposite G. We found that all three mutant 
variants cleaved the abasic site as efficiently as WT ROS1 
(Figure 3C). These results indicate that R903A, M905G 
and Q607A exhibit a specific defect in catalysis of 
glycosylic bond cleavage. Because such defect is greatly 
alleviated when the target base is mispaired, we conclude 
that Q607, R903 and M905 are critical for destabilization 
of 5-meC:G base pairs and are likely to perform a key role 
in extrusion of the target base from DNA. 

R903 and M905 are dispensable for DNA binding 

We have previously reported that the Q607A mutant 
exhibits a drastically reduced DNA binding capacity 
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Figure 1. Identification of putative helix-invading residues in ROS1. (A) Schematic diagram showing conserved regions among members of the 
ROS1/DME family: a N-terminal lysine-rich region (green), a noncontiguous DNA glycosylase domain distributed over two segments (blue and red) 
separated by a nonstructured linker region and a highly conserved C-terminal domain (yellow) that is not found in any other protein family. 
(B) Multiple sequence alignment of part of the DNA glycosylase domain of ROS1/DME proteins and several HhH-GPD superfamily members. 
ROS1 amino acids analyzed in this work are indicated by inverted triangles and highlighted in orange (Q607), green (R903) or pink (M905). Names 
of organisms are abbreviated as follows: Ath, Arabidopsis thaliana; Nta, Nicotiana tabacum; Bst, Bacillus stearothermophilus; Eco, Escherichia coli 
Mth, Methanobacterium thermoautotrophicum; Mmu, Mas masculus; Hsa, Homo sapiens. Genbank accession numbers are as follows: Ath ROS1 
AAP37178; Ath DME: ABC61677; Nta ROS1: BAF52855; Bst EndoIII: 1P59; Eco EndoIII: P20625; Mth Mig: NP_039762; Eco MutY: NP_417436 
Mmu MBD4: 1NGN; Hsa OGG1: 015527; Eco AlkA: P04395. (C) Structural model for the DNA glycosylase domain of ROS1 bound to a DNA 
containing an abasic site. The position of Q607, R903 and M905 residues (colored as in panel B) in relation to the estranged guanine (blue) are 
shown. The model was generated as described in (20) and the figure was prepared with PyMOL (http://www.pymol.org). (D) Schematic sequence 
diagram indicating the predicted interactions between mutated amino acids and the orphan guanine (blue) opposite 5-meC (M, in red). 



(20). We therefore decided to examine whether R903 and/ 
or M905 also play a role in substrate binding. We 
incubated increasing concentrations of WT ROS1, 
R903A, M905G or Q607A proteins with a labeled DNA 
substrate containing a 5-meC:G pair (Figure 4). No 
protein-DNA complexes were detected in binding reac- 
tions containing the mutant protein Q607A. In the case 
of WT ROS1, R903A and M905G proteins, we observed a 
major band with retarded mobility at high protein concen- 
trations and a minor diffuse band, exhibiting higher 
mobility that was detectable at low protein concentra- 
tions. Analogous results were obtained with 
nonmethylated DNA (data not shown). These results 
suggest the formation of protein-DNA complexes con- 
taining more than one ROS1 molecule. A similar 



observation has been previously reported for a mamma- 
lian homolog of the DNA glycosylase MutY (29). 
Quantitation of the total amount of protein-DNA 
complexes (Figure 4, left) indicates that there are not sig- 
nificant differences in DNA binding capacity between WT 
ROS1 and either R903A or M905G proteins. 

We next examined in detail the binding affinity of 
WT ROS1, R903A and M905G through competition 
experiments with unlabeled oligonucleotides (Figure 5). 
In agreement with its methylation-independent DNA 
binding capacity (21), WT ROS1 binding to a methylated 
DNA probe was reduced with equivalent intensity 
when incubated with increasing amounts of either 
nonmethylated or methylated unlabeled competitor 
DNA (Figure 5, upper panel). We found that the two 
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mutant proteins exhibited dissociation rates not signifi- 
cantly different from those of the WT protein, and both 
bound methylated and nonmethylated DNA with similar 
affinity (Figure 5, middle and lower panels). We also 
found that the two mutant versions do not show any 



enhanced affinity for DNA substrates containing mis- 
matched 5-meC, or for a single nucleotide gap opposite 
A, T or C (Supplementary Figure S4). Altogether, these 
results indicate that R903 and M905 are dispensable for 
DNA binding. 
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Figure 2. Enzymatic activity of WT ROS1 and mutant variants on 
DNA substrates containing 5-meC opposite different bases. Purified 
proteins (20 nM) were incubated at 30°C with 51-mer double-stranded 
oligonucleotide substrates (20 nM) containing 5-meC opposite G, A, T 
or C, as indicated. Relative processing efficiencies were determined in 
kinetic assays as described in 'Materials and Methods' section. Values 
are mean ± SE (error bars) from three independent experiments. 



Q607 inhibits ROS1 sliding along DNA 

The results discussed above indicate that residues Q607, 
R903 and M905 are required for excision of 5-meC 
opposite G, but only the former is critical for stable 
DNA binding (20). We have suggested that Q607 is the 
plug residue used by ROS1 to flip out 5-meC and com- 
pensate its extrusion by filling in the vacant space in the 
DNA base stack (20). Because Q607 is required for stable 
binding to both methylated and unmethylated DNA (20), 
we hypothesized that ROS1 performs extrahelical interro- 
gation of unmethylated base pairs by insertion of this 
residue into the DNA helix. 

We have recently reported that ROS1 performs sliding 
on DNA while searching for its target base (22). 
Therefore, we reasoned that if Q607 plays any role in 
DNA interrogation, the absence of this residue in the 
mutant protein should have an effect on DNA sliding. 
To examine this possibility, we compared the diffusive 
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Figure 3. Q607A, M905G and R903A mutant proteins lack DNA glycosylase activity on 5-meC:G pairs but retain AP liase activity. (A) Schematic 
diagram of ROS1 DNA glycosylase/AP lyase activity on 5-meC. ROS1 excises 5-meC as a free base and then cleaves the phosphodiester backbone at 
the 5-meC removal site by successive (3,5-elimination. (B) DNA glycosylase assay. The generation of incision products was measured by incubating 
purified WT ROS1 or mutant variants (20 nM) at 30°C for 4h with a double-stranded oligonucleotide substrate (20 nM) containing a single 5-meC:G 
pair. After incubation, NaOH (100 nM) was added and samples were immediately transferred to 90°C for lOmin. Products were separated in a 12% 
denaturing polyacrylamide gel and the amounts of incised oligonucleotide were quantified by fluorescent scanning. (C) AP liase assay. A double- 
stranded oligonucleotide substrate containing an AP site opposite G (20 nM) was incubated at 30°C for 2h in the presence of purified WT ROS1 or 
mutant variants (20 nM). Samples were treated with NaBH 4 (300 mM) at 0°C for 30min to stabilize nonprocessed AP sites and neutralized with 
100 mM Tris-HCl, pH 7.4. Products were separated in a 12% denaturing polyacrylamide gel and the amount of incised oligonucleotide was 
quantified by fluorescent scanning. Values are mean ± SE (error bars) from three independent experiments. 



Nucleic Acids Research, 2013, Vol. 41, No. 18 8659 




0 20 40 60 80 100 120 140 




0 20 40 60 80 100 120 140 




0 10 20 40 60 80 100 120 140 nM Protein 



r i 



Free DNA 



< 

CO 

o 



. • m m m 

PI 1 



—Free DNA 



o 

















M 

1* 







—Free DNA 



0 20 40 60 80 100 120 140 





-Free DNA 



0 20 40 60 80 100 120 140 
Protein concentration (nM) 

Figure 4. DNA binding capacity of R903A, M905G and Q607A mutant proteins. Increasing concentrations of purified WT ROS1 or mutant 
variants were incubated at 25°C for lh with 10 nM of fluorescein-labeled 5-meC:G duplex. After nondenaturing gel electrophoresis, gels were 
scanned to detect fluorescein-labeled DNA. Protein-DNA complexes were identified by their retarded mobility compared with that of free DNA. 
A representative gel is shown for each protein. Graphs on the left show the percentage of protein-DNA complex versus protein concentration. All 
bands with slower mobility were used in quantitation of bound protein. Values are mean ± SE (error bars) from three independent experiments. 



behavior of WT ROS1 and Q607A on a substrate con- 
taining tetraloop obstacles along the DNA surface 
(Figure 6). We preincubated WT ROS1 or Q607A with 
labeled substrates S or SL1-2 and then added increasing 
concentrations of unlabeled S competitor to promote dis- 
sociation. Consistent with our previously reported obser- 
vations (22), we found that WT ROS1 dissociates from 
substrate S when chased by the competitor, but remains 
bound to substrate SL1-2 even at high competitor con- 
centrations. As expected, Q607A was unable to bind sub- 
strate S. However, the mutant protein was able to form a 
stable complex with substrate SL1-2, resisting competi- 
tion with increasing concentrations of unlabeled S 
(Figure 6). By performing DNA binding measurements 



at different time points in the absence of competitor, we 
detected stable complexes of WT ROS1 with both S and 
SL1-2, whereas Qf507A only bound stably to SL1-2 
(Figure 7). Furthermore, we found that unblocking one 
of the substrate ends greatly reduced the capacity of 
Q607A to form a stable complex with DNA 
(Supplementary Figure S5). We therefore conclude that 
the Q607A variant is unable to form a stable complex 
with a DNA substrate containing free ends, but remains 
bound to a molecule whose ends are obstructed with 
tetraloop blocks. Importantly, we found that the stable 
binding of Q607A to substrate SL1-2 did not have any 
positive effect on the reduced catalytic activity of 
the mutant protein (Supplementary Figure S6), thus 
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Competitor molar excess 

Figure 5. Binding of R903A and M905G mutant proteins to methylated and nonmethylated DNA. WT ROS1 (upper panel), R903A (center panel) 
or M905G (lower panel) proteins (130 nM) were incubated at 25°C for 1 h with 100 nM labeled methylated DNA containing a single 5-meC:G pair, 
in the presence of increasing amounts (0, 0.1, 0.25, 0.5 and 1.0 uM) of either methylated (lanes 2-6) or nonmethylated (lanes 2 and 7-10) unlabeled 
competitor DNA. After nondenaturing gel electrophoresis, the gel was scanned to detect fluorescein-labeled DNA. Protein-DNA complexes were 
identified by their retarded mobility compared with that of free DNA, as indicated. Graphs on the left show the percentage of remaining complex 
versus competitor molar excess ratios. Values are mean ± SE (error bars) from three independent experiments. M indicates 5-meC. 



corroborating the essential role of Q607 for base 
excision. Altogether, these results indicate that Q607 
has an inhibitory effect on ROS1 sliding along DNA, 
and suggest that the enzyme extrudes unmethylated 
bases for interrogation by inserting this residue into the 
base stack. 



DISCUSSION 

The conserved base-flipping mechanism used by DNA 
glycosylases entails the need to stabilize the resulting dis- 
tortion of DNA. Central to this process is the intercalation 
of two residues into the base stack: a plug residue that fills 
the gap left by the flipped-out nucleotide and a wedge 
residue that intrudes between bases on the opposite 
strand and interacts with the orphan base. Our 
homology modeling and functional analysis of ROS1 
suggest that Q607 and M905 serve the roles of plug and 
wedge residues, respectively, with R903 additionally per- 
forming a critical function in the stabilization of the 
orphan G. In this work, we have analyzed the function 
of these three putative DNA intercalating residues in ini- 
tiation of DNA demethylation by ROS1 glycosylase. 

We have found that Q607A, M905G and R903A 
variants fail to process 5-meC:G pairs, but remain com- 
petent in cleaving an abasic site opposite G. These results 
demonstrate that mutations in any of these three residues 
specifically impair early 5-meC recognition events, leaving 



intact the capacity to perform downstream steps in the 
base excision cascade. Similar separation of function mu- 
tations have been reported in other bifunctional DNA 
glycosylases. For example, blocking the 8-oxoG recogni- 
tion pocket of hOGGl with a bulky amino acid side 
chain generates a mutant variant unable to flip 8-oxoG 
nucleotides in DNA, but still able to recognize and 
cleave abasic sites (30). Interestingly, mutation of two 
DNA intercalating residues in Fpg (also called MutM) 
(F114A and M77A) significantly decreases 8-oxoG 
excision but preserves AP lyase activity (31). Although 
our results do not exclude the possibility that Q607A, 
M905G and/or R903A may facilitate AP site hydrolysis 
following 5-meC excision, they suggest that the major 
catalytic defect in the three mutant proteins relates to 
upstream steps of the base extrusion pathway. 

WT ROS1 excises 5-meC more efficiently from 
mismatches (26), which suggests that facile extrusion 
from the helix plays a critical role in 5-meC excision. 
The ROS1 model structure predicts that Q607, M905 
and R903 are candidates to intercalate in DNA, and there- 
fore they might play a role in destabilization and extrusion 
of 5-meC opposite G. This hypothesis is supported by the 
remarkable observation that all three mutant ROS1 
proteins retain a significant 5-meC excision activity when 
the target base is mispaired. Increased base excision 
activity by disruption of proper base pairing has been pre- 
viously reported for several glycosylases (32-34). UDG 
has been proposed to capture spontaneously extruded 
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identified by their retarded mobility compared with that of free DNA, as indicated. Graphs on the left show the percentage of remaining 
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uracils (35), and displays higher activity when its target 
base is mispaired with an unnatural adenine analog that 
lacks Watson-Crick hydrogen-bonding groups (33). DNA 
glycosylases that use an active base-flipping mechanism, 
such as Fpg and human AAG, also excise their target 
bases with higher efficiency when mispaired with 
nonpolar analogs (32,34), an effect that has been 
attributed to facilitated base extrusion from the helix. 
Interestingly, the catalytic activity of human AAG 
mutated in the plug residue Y162 was partially restored 
when the base opposite the target was changed to a 
nonhydrogen bonding partner (34). This result mirrors 
our observation that variant Q607A retains a significant 
activity on mismatched 5-meC, thus suggesting that 
facilitated extrusion of the target base partially bypasses 
requirement for helix-invading residues. 

Insertion of the plug residue into the gap left by the 
extruded base is likely to play a crucial role in stabilizing 



the lesion-recognition complex. In fact, it has been con- 
sistently observed in different DNA glycosylases that 
mutation of the plug residue not only curtails catalytic 
activity, but also abrogates stable binding to DNA sub- 
strates (34,36,37). However, there is growing evidence 
that, in addition to stabilizing the extrahelical state after 
base flipping, side chains that intercalate into DNA play a 
critical role during target location and substrate specificity 
(31,38-40). The results described here strengthen the 
hypothesis that ROS1 uses Q607 as a plug residue for 
efficient 5-meC extrusion. In addition, they suggest that 
ROS1 requires such residue for extrusion of both 
methylated and unmethylated bases in a DNA scanning 
process. There are several observations that, taken 
together with the stable binding of Q607A to a DNA sub- 
strate with blocked termini, support the idea that Q607A 
is not just a DNA binding-deficient mutant, but a faster 
slider with decreased extrusion rates on both methylated 
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and unmethylated bases. WT ROS1 binds with similar 
affinity to methylated and nonmethylated DNA, whereas 
Q607A does not show detectable binding activity on either 
substrate. However, despite such undetectable binding 
capacity, Q607A exhibits a significant activity on targets 
that do not require full extrusion competence, such as 
mismatched 5-meC and AP sites, and displays excision 
rates similar to those of the binding-proficient versions 
M905G and R903A. In contrast, the increased binding 
of Q607A to a DNA substrate with blocked ends does 
not result in an increased activity on 5-meC:G pairs 
(Supplementary Figure S6), which suggests that unstable 
DNA binding is a derived consequence of a deficiency in a 
putative plug residue. We therefore propose that the 
unstable DNA binding exhibited by the Q607A mutant 
is most likely due to increased sliding along DNA, thus 
implying that the side chain of Q607 inhibits the scanning 
rate of ROS1. The role of helix-invading residues in 
controlling motion of a DNA glycosylase along DNA 
is not without precedent. In Fpg, which performs 
intrahelical interrogation (39), mutations of the wedge 
(F114) and plug (R112) residues increase the diffusion 
rate (40) and induce strandwise translocation (41), 
respectively. 

Our results shed some light on the possible mechanism 
used by ROS1 to locate its target base. Three possible 
strategies have been proposed to explain how DNA 
glycosylases find their target bases on DNA: (i) passive 
capture of extrahelical target bases that have spontan- 
eously emerged from the DNA base stack (35), (ii) 
intrahelical inspection of the relative strength and flexibil- 
ity of base pairs by inserting one or more residues into the 
DNA duplex (38,42) and (hi) sequential extrusion of every 
base out of the DNA helix for extrahelical interrogation 



(43). Because methylation has been shown to decrease 
base opening rates (44), it is unlikely that ROS1 passively 
captures 5-meC residues spontaneously emerged from the 
DNA base stack at 5-meC:G pairs. By other hand, the 
inhibitory role of Q607 on ROS1 sliding along DNA 
suggests that base flipping precedes substrate discrimin- 
ation, and that the enzyme actively interrogates bases on 
an extrahelical rather that intrahelical conformation. 
Intrahelical inspection may be rather inefficient to distin- 
guish between C and 5-meC. Methylation does not induce 
gross conformational changes in DNA (45) and the 
5-methyl group is located on the major groove, whereas 
the catalytic domain of ROS1, as in all other DNA 
glycosylases (23), is expected to bind DNA through the 
minor groove. Intriguingly, both the DNA-binding 
domain of the methyl-specific endonuclease McrBC from 
E. coli (46) and the SET- and Ring-associated (SRA) 
domains of mammalian UHRF1 (47) and Arabidopsis 
SUVH5 (48) flip out 5-meC from the base stack to 
achieve discrimination of 5-meC from C. In any case, a 
full understanding of the search mechanism used by ROS1 
must await structural information for this enzyme. 
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